Building part compositions for hierarchical object recognition
نویسنده
چکیده
The visual signature of objects in our world follow a particular blueprint: every object is composed of an arrangement of smaller visual "parts". These parts can be thought of on many different scales. An image of an car can be considered to be pattern of wheel, window, door, and various other compositional parts, or on a much more granular scale as a massive pattern of interconnected line segment parts. These levels of granularity lend themselves well to a hierarchical organization of visual blueprints: the car can be decomposed into doors and other similarly-scaled parts, each door can be decomposed into a window, a handle, and other features, and this reduction in scale can continue until a suitable base level of elemental parts is reached. In their 2007 paper, "Towards Scalable Representations of Object Categories: Learning a Hierarchy of Parts" [3], Fidler and Leonardis described an algorithm to implement such a categorization scheme with minimal human intervention. Given a large corpus of images, they showed their algorithm organizing subparts into patterns completely automatically from the bottom elemental layer to parts three layers higher. At this level, parts became too specialized to learn with a generic image corpus, so further learning was done on manually separated sets of images grouped by categories (e.g. faces, mugs, etc.). According to the authors, this was the only source of significant human intervention in the process. This work has attempted to recreate their algorithm from scratch. Unfortunately, this task was not successfully completed. Although the entire framework of the algorithm was written, automating the core pattern processing part of it proved to be a very nuanced and difficult task to complete. This paper will discuss the nature of this difficulty and describe the various successes and setbacks in developing a principled way to process the patterns in the data automatically.
منابع مشابه
Urban Vegetation Recognition Based on the Decision Level Fusion of Hyperspectral and Lidar Data
Introduction: Information about vegetation cover and their health has always been interesting to ecologists due to its importance in terms of habitat, energy production and other important characteristics of plants on the earth planet. Nowadays, developments in remote sensing technologies caused more remotely sensed data accessible to researchers. The combination of these data improves the obje...
متن کاملLearning Hierarchical Representations of Object Categories for Robot Vision
This paper presents our recently developed approach to constructing a hierarchical representation of visual input that aims to enable recognition and detection of a large number of object categories. Inspired by the principles of efficient indexing, robust matching, and ideas of compositionality, our approach learns a hierarchy of spatially flexible compositions, i.e. parts, in an unsupervised,...
متن کاملHierarchical Spectro-Temporal Models for Speech Recognition
We seek to explore computational approaches for audition that are inspired by computational visual neuroscience. In particular, we seek to leverage recent progress over the past few years in building a biologically-faithful hierarchical, feed-forward system for visual object recognition [13,14]. The system, which was designed to closely match the currently known feed-forward path in the ventral...
متن کاملLearning Hierarchical Sparse Representations using Iterative Dictionary Learning and Dimension Reduction
This paper introduces an elemental building block which combines Dictionary Learning and Dimension Reduction (DRDL). We show how this foundational element can be used to iteratively construct a Hierarchical Sparse Representation (HSR) of a sensory stream. We compare our approach to existing models showing the generality of our simple prescription. We then perform preliminary experiments using t...
متن کاملEfficient Clustering and Matching for Object Class Recognition
In this paper we address the problem of building object class representations based on local features and fast matching in a large database. We propose an efficient algorithm for hierarchical agglomerative clustering. We examine different agglomerative and partitional clustering strategies and compare the quality of obtained clusters. Our combination of partitional-agglomerative clustering give...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009